arXiv: 1505.023 18vl [cs.DM] 9 May 2015 



Parameters for minimal unsatisfiability: 
Smarandache primitive numbers and full clauses* 



Oliver Kullmann 1 and Xishun Zhao 2 

1 Computer Science Department, Swansea University 
2 Institute of Logic and Cognition, Sun Yat-sen University 



Abstract. We establish a new bridge between propositional logic and 
elementary number theory. A full clause contains all variables, and we 
study them in minimally unsatishable clause-sets (MU); such clauses 
are strong structural anchors, when combined with other restrictions. 
Counting the maximal number of full clauses for a given deficiency k, 
we obtain a close connection to the so-called “Smarandache primitive 
number” S 2 (k), the smallest n such that 2 fc divides n\. 

The deficiency k > 1 of an MU is the difference between the number of 
clauses and the number of variables. We also consider the subclass UHIT 
of MU given by unsatishable hitting clause-sets (every two clauses clash). 
We study the four fundamental quantities FCH, FCM, VDH, VDM : N —¥ 
N, defined as the maximum number of full clauses in LTHIT resp. MU, 
resp. the maximal minimal number of occurrences of a variable (the 
variable degree) in UHIT resp. MU, in dependency on the deficiency. 
We have the relations FCH(fc) < FCM(fc) < VDM(fc) and FCH(fc) < 
VDH(fc) < VDM(fc), together with VDM(fc) < nM(fc) < k + 1 + log 2 (fc), 
using the “non-Mersenne numbers” nM(fc) as established in [21] , 

We show the lower bound ^(fc) < FCH(fc); indeed we conjecture this 
to be exact. The proof rests on two methods: Applying subsumption 
resolution and its inverse, and analysing certain recursions, combining 
an application-specific recursion with a recursion from the held of meta- 
Fibonacci sequences. The 5'2-lower bound together with the nM-upper- 
bound yields a good handle on the four quantities, which we determine 
for 1 < k < 13. 



1 Introduction 

Long clauses often occur in practical instances; we study the most extreme 
case, the occurrences of full clauses, clauses of maximal possible length, in min- 
imal unsatishable clause-sets (F £ MU). The main parameter is the deficiency 
5(F) = c(F) — n(F) > 1, the number of clauses minus the number of variables. 
We denote by FCM(fc) the maximal possible number of full clauses in F £ MU 
with 5(F) = k (short: F £ MUs-k)- From [21] Theorem 15] follows the up- 
per bound FCM(fc) < nM(fc) for the non-Mersenne numbers nM(fc) £ N, with 
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k + [log 2 (fc+ 1)J < nM(fc) < k+ 1 + |k>g 2 (fc)J ([21] Corollary 10]). Until now no 
general lower bound on FCM(fc) was known, and we establish 52 (fc) < FCM(fe). 
Here 5 2 (fc), as introduced in [28] . is the smallest n £ No such that 2 k divides n!, 
and various number-theoretical results on 5 2 and the generalisation S p for prime 
numbers p are known. Actually we show a stronger lower bound, namely we do 
not consider all F £ MUg=k, but only those F which are hitting (F £ UHIT), 
that is, where every two clauses clash, yielding FCH(fc) with FCH(fc) < FCM(fc), 
and we show 5 2 < FCH. The elements oiUHIT are known in the DNF language 
as “orthogonal” or “disjoint” tautological DNF, and when considering arbitrary 
boolean functions, then also “disjoint sums of products” (DSOP) or “disjoint 
cube representations” are used; see [25] Section 4.4] or [4] Chapter 7]. 

The central underlying research question is the programme of classification 
of MU in the deficiency, that is, the characterisation of the layers MUg= k for 
k £ N. A special case of the general classification is the classification ofUHITs=k- 
The earliest source [T] showed (in modern notation) 6(F) > 1 for F £ MU, and 
characterised the special case SMUg=i C MU$= i, where SMU C MU contains 
those F £ MU such that no literals can be added to any clauses without de- 
stroying unsatisfiability. Later [5] characterised MUs=i via matrices, while the 
intuitive characterisation via binary trees was given in |161 Appendix B], where 
also SMU$=i = UHXTs=\ has been shown. In the form of “5-matrices”, the 
class MUs=i had been characterised earlier in Pin] , going back to a conjecture 
on Qualitative Economics (0), and where the connections to this field of ma- 
trix analysis, called “Qualitative Matrix Analysis (QMA)”, where first revealed 
in [18] (see [15] Subsection 11.12.1] and [23] Subsection 1.6.4] for overviews). 
SMUs =2 and partially MUg =2 were characterised in im. with further informa- 
tion on MUs=2 in mm showed that all layers MUg= k are poly-time decidable. 

A key element for these investigations into the structure of MU is the min- 
var-degree pvd(F) := min^ gvar (^ \ |{C £ F : v £ var(C')}|, the minimal variable- 
degree of F, and its maximum VDM(fc) over all F £ MUg=k- Indeed the key to 
the characterisation of MUg—i in [5] as well as in m was the proof of VDM(l) = 
2. The first general upper bound Vfc £ N : VDM(fc) < 2k was shown in [T6J 
Lemma C.2]. Now in |21j . mentioned above, we actually showed the upper bound 
VDM(fc) < nM(fc). Using fc(F) for the number of full clauses in F, obviously 
fc (F) < /ivd (F) holds, and FCM(fc) is the maximum of fc(i r ) over all F £ 
MU s=k , thus FCM(fc) < VDM(jfe). 

For the variation VDH(fc) < VDM(fc), which only considers hitting clause- 
sets, we conjecture VDH(fc) = VDM(fc) for all k > 1. Furthermore we conjecture 
FCM(fc) > nM(fc) — 1, and thus the quantities nM(fc), VDM(fc), VDH(fc), FCM(fc) 
are believed to have at most a distance of 1 to each other. On the other hand we 
conjecture FCH(fc) = 5 2 , where 5 2 (fc) oscillates between the linear function k + 1 
and the quasi- linear function k + 1 + [log 2 (fc)J ■ Altogether the “four fundamen- 
tal quantities” FCH, FCM, VDH, VDM are fascinating and important structural 
parameters, whose study continues to reveal new and surprising aspects oi MU 
and UHIT. 
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It is also possible to go beyond MU\ in [23l Section 9] it is shown that when 
considering the maximum of gvd(F) over all F £ ££AAfs=k £> M.Us=k , the set 
of all “lean” clause-sets, that then nM(fc) is the precise maximum for all k > 1. 
Lean clause-sets were introduced in m as the clause-sets where it is not possible 
to satisfy some clauses while not touching the other clauses, and indeed were 
already introduced earlier, as “non- weakly satisfiable formulas (matrices)” in the 
field of QMA by [T2|. Furthermore it is shown in :23l Section 10], that there is 
a polytime “autarky reduction” , removing some clauses which can be satisfied 
without touching the other clauses, which establishes for arbitrary clause-sets F 
the upper bound [ivd(F) < nM(5(i 7 ')); an interesting open question here is to 
find the witnessing autarky in polynomial time. 

Back to the main result of this paper, the proof of S 2 < FCH is non-trivial. 
Indeed the proof is relatively easy for a function S' 2 (k) defined by an appropriate 
recursion, motivated by employing full subsumption extension C CU{v}, FU 
{U} in an optimal way. Then the main auxiliary result is S 2 = S 2 - For that we 
use another function, namely 02 (fc) as considered in [24| in a more general form, 
while ci 2 was introduced with a small modification in [3]. These considerations 
belong to the field of meta- Fibonacci sequences , where special nested recursions 
are studied, initiated by (U Page 145]. Via a combinatorial argument we derive 
such a nested recursion from the course-of- value recursion for S 2 , which yields 
S 2 = 2a 2 . We also show 2o 2 = S 2 (this equality was conjectured on the OEIS 
im and we obtain S 2 = S 2 - 

Overview. The main results of this paper are as follows. Theorem [18] proves 
S 2 = 2 a 2 . Theorem M shows a meta-Fibonacci recursion for S 2 , where S 2 is 
introduced by a recursion directly related to our application. Theorem [33] then 
proves S 2 = 5' 2 . After these number-theoretic preparations, we consider sub- 
sumption resolution and its inversion (extension); Theorem 1381 combines sub- 
sumption extension and the recursion machinery, and shows S 2 < FCH. In the 
remainder of the paper, this fundamental result is applied. Theorem 1391 proves a 
tight upper bound on S 2 , while Theorem 02] considers the cases where the lower 
bound via S 2 and the upper bound via nM coincides. Finally in Theorem 1471 we 
determine the four fundamental quantities for 1 < k < 13 (see Table [l]). 

2 Preliminaries 

We use Z for the set of integers, No := {n £ Z : n > 0}, N := No \ {0}, and 
finally M. D Z is the set of real numbers. For maps /, g : X — »• R we write / < g 
if Va- £ X : f{x) < g(x). 

On the set CIT of literals we have complementation x £ CIT x , with 
x ^ x and x = x. We assume Z \ {0} C CIT , with z = — 2 for z £ Z \ {0}. 
Variables VA C CIT with N C VA are special literals, and the underlying 
variable of a literal is given by var : CIT — » VA, such that for v £ VA holds 
var(v) = var(U) = v , while for x £ CIT \ VA holds x = var(s). For a set 
L C CIT we define L := {x : x £ L}. A clause is a finite set C of literals with 
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C n C = 0 (C is clash- free). A clause-set is a finite set of clauses, the set of all 
clause-sets is denoted by CCS. 

For a clause C we define var(C) := {var(:r) : x € C} C VA, and for a 
clause-set F we define var (F) := (J ceF var (^') c VA We use the measure 
n(F) := |var(F)| £ No and c(F) := \F\ £ No, while the deficiency is 5(F) := 
c(F) — n(F) £ Z. 

The set of satisfiable clause-sets is denoted by SAT C CCS , which is the set 
of clause-sets F such that there is a clause C which intersects all clauses of F, i.e. , 
with V D £ F : CTiD 0; the unsatishable clause-sets are US AT := CCS\SAT ■ 
The set MU C US AT of minimally unsatishable clause-sets is the set of 
F £ USAT, such that for F' C F holds F' £ SAT. The unsatishable hitting 
clause-sets are given by WHIT := {F £ USAT | V C, D £ F,C T F> : C fl D ^ 
0}. It is easy to see that UHIT £ MU holds, and that for all F £ UHIT holds 
T~\ c \ = 1. While all definitions are given in this paper, for some more 
background see [IK] . 

2.1 Full clauses 

A full clause for F £ CCS is some C £ F with var(C) = var(F) (equivalently, 
|C| = n(F)), and the number of full clauses is counted by fc : CCS — > No, which 
can be dehned as fc (F) := c(FflA(var(F))), and where A(V) £ UHIT for some 
hnite V C VA is the set of all clauses C with var(C') = V. Standardised versions 
of the A(V) are A n := A({1, . . . ,n}) for n £ No- The following observation is 
contained in the proof of El Utterly Trivial Observation]: 

Lemma 1. For F £ UHIT , F T {-L}, the number fc(F) of full clauses is even. 

Proof. Let n := n(F). We have YlceF = 2 n , and thus is 

even (due to n > 0). Since J2cgf \C\^n i s even, the assertion follows. □ 

2.2 The four fundamental quantities 

For F £ CCS we dehne the var-degree as vd^(v) := c({C £ F : v £ var(C)}) £ 
No for v £ VA, while in case of var(F) T 0 F £ {T, {-L}}) we dehne the 
min-var-degree fivd(F) := min„ gvar ( F ) vd^ (u) £ N. 

Definition 2. For k £ N let 

— FCH(fc) £ N be the maximal fc(F) for F £ UHITs-k / 

— FCM(fc) £ N be the maximal fc (F) for F £ MUs=k,' 

— VDH(fc) £ N be the maximal /ivd(F’) for F £ UHITs-k', 

— VDM(fe) £ N be the maximal /ivd (F) for F £ MU$ = k- 

For k = 1 the case F = {_L} is excluded in the last two definitions. 

By [21 Lemma 9, Corollary 10, Theorem 15]: 
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Theorem 3 (|21j). VDM(fc) < nM(fc) = k + [log 2 (fc + 1 + [log 2 (fc + 1)J)J < 
k + 1 + |tog 2 (fc)J for all k G N. 

Here nM : N — > N is the enumeration of natural numbers excluding the Mersenne 
numbers 2 n — 1 for n G N; the list of initial values is 2,4,5,6,8,9,10,11,12, 
13,14,16,17 (http://oeis.org/A062289). In [23] Theorem 14.4] it is shown 
that VDM(6) = 8 = nM(6) — 1, extending this to an improved upper bound 
VDM < nMi ( [23, Theorem 14.6], where nMj : N — >■ N can be defined as 
follows: nMi(fc) := nM(fc) for k G N with k ^ 2” — n + 1 for some n > 3, while 
nM^" — n + 1) := nM(2™ — n + 1) — 1 = 2 n ; see Table |T] for initial values. 

Theorem 4 (|23j). For k G N holds VDM(fc) < nMi(fc) < nM(fc). 

We conclude these preparations with a special property of FCH(fc) (support- 
ing our Conjecture [48] that FCH = S 2 ), namely by Lemma[l]we have: 

Corollary 5. FCH(/c) is even for all k GN. 



3 Some integer sequences 

We review the “Smarandache primitive numbers” S 2 (k) and the meta-Fibonacci 
sequences a 2 (fc). We show in Theorem [151 that S 2 = 2 a 2 holds. 

3.1 Some preparations 

We define two general operations a i-> Aa and a i— >• fp a for sequences a. First 
the (standard) Z\-operator: 

Definition 6. For a : I —> M, where I C Z is closed under increment , we define 
Aa : I — > K by Aa(k) := a(k + 1) — a(k). 

So a is monotonically increasing iff Aa > 0, while a is strictly monotonically 
increasing iff Aa > 1. Sequences with exactly two different A- values, where one 
of these values is 0, play a special role for us, and we call them “d-Delta”, where 
d is the other value: 

Definition 7. A sequence a : No — > Z is called d- Delta for d G Z \ {0}, if 
Z\a(N 0 ) = {Z\a(n)}„ eNo = {0,d}. 

While the zl-operator determines the change to the next value, the plateau- 
operator determines subsequences of unchanging values: 

Definition 8. For a sequence a : N — > K which is non- stationary (for all i 
there is j > i with aj ^ ai) we define a : N — > N (the “plateau operator ”) by 
letting a(n) for n G N be the size of the n-th (maximal) plateau of equal values 
(maximal intervals ofN where a is constant). 

So ^^(l) is the size of the first plateau, < ]3a(2) the size of the second plateau, 
and so on; Vi G N : a(i) a(i + 1) iff a is the constant 1-function. For a 
d-Delta sequence a from ‘jf a and the initial value cq we can reconstruct a. 
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3.2 Smarandache primitive numbers 

The “Smarandache Primitive Numbers” were introduced in [28j Unsolved Prob- 
lem 47]: 

Definition 9. For k £ No let S^fc) be the smallest n £ No such that 2 k divides 
n\. Using ord 2 (n), n £ N, for the maximal m £ No such that 2 m divides n, we 
get that S^fc) for k £ No is the smallest n £ No such that k < ]T]” =1 ord 2 (i). 

So 52(0) = 0, and AS^No) = {0,2}. The following is well-known and easy to 
show (see Subsection III. 1 in [9] for basic properties of 82 (h)): 

Lemma 10. The sequence £ 2 ( 1 ), £' 2 ( 2 ), 3), ... is obtained from the sequence 

1,2,3,... of natural numbers, when each element n £ N is repeated ord 2 (n) 
many times. 

Example 11. The numbers .Sa ( A: ) for k £ (1, . - . , 25} are 2,4,4,6,8,8,8,10,12, 
12, 14, 16, 16, 16, 16, 18, 20, 20, 22, 24, 24, 24, 26, 28, 28. The corresponding OEIS- 
entry (which has 1 as first element (index 0), instead of 0 as we have it, and 
which we regard as appropriate) is http : / / oeis . org/A007843. 

Lemma 12 (|30j). For k £ N holds k + 1 < S 2 {k) = k + 0(log k). 

We give an independent proof for the lower bound in Lemma tlOl while we sharpen 
the upper bound in Theorem 1391 For more number-theoretic properties of S 2 see 
[29] . To understand the plateaus of S 2 , we need the ruler function: 

Definition 13. Let ru„ := ord2(2n) £ N for n £ N. 

The plateaus of S 2 are given by the ruler function: in Lemma [TO] we deter- 
mined the number of repetitions of values »£Nas ord 2 (v ) , while for the plateaus 
we skip zero-repetitions, which happen at each odd number, and thus for the 
associated index n we have n = \ for even v, and the number of repetitions is 
ord 2 (w) = ord 2 ( 2 ?x); we obtain 

Lemma 14. ^(S 2 (k)) k& n = (ru„) neN - 

3.3 Meta-Fibonacci sequences 

Started by [8] Page 145], various nested recursions for integer sequences have 
been studied. Often the focus in this field of “meta-Fibonacci sequences” is 
on “chaotic behaviour” , but we consider here only a well-behaved case (but in 
detail): 

Definition 15. In J2f | / the sequence 02 '■ No — > Nrl^l. has been defined recursively 
via 

a, 2 (k) = 02 {k — a 2 (k — 1)) + a 2 (k — 1 — 02 (k — 2)), 
while 02 (h) := k for k £ {0, 1}. 



hiding two parameters d £ N, s £ Z used in |24|, which are d = 2, s = 0 in our case 
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The sequence 02 was introduced in [3] as F : N — > No, with F(k) = fc — 1 for 
k € {1, 2} and the same recursion law, which yields F(k) = 02 (k — 1) for k £ N. 
Furthermore, using F'(l) = F'{ 2) = 1 as initial conditions does not change 
anything else, and this sequence is the OEIS entry http://oeis.org/A046699 
It is shown (in our notation): 

Lemma 16 ([3]). For k G N and p := |_log 2 (fc + 1)J : 02 (h) = 2 P_1 + a 2 (fc + 1 — 
2 P ). 

Lemma fl6l yields a fast computation of 02 (h). [TO] Corollary 2.9, Equation (1)] 
determines the plateau sizes: 

Lemma 17 ( |10j ). a 2 is a 1-Delta sequence with < P(a 2 (fc))fe g N = ru. 

We can now show a 2 = ySa, which has been conjectured on the OEIS 
(http://oeis.org/A007843, by Michel Marcus): 

Theorem 18. \/k £ No : S 2 (fc) = 2 • 02 (h). 

Proof. By Lemma H4l and Lemma H71 together with S 2 (0) = a 2 (0) = 0. □ 



4 Recursions for Smarandache primitive numbers 

In Subsection 14.11 we introduce the sequence S' 2 via a recursive process, which 
directly ties into our main application in Theorem [38] for constructing unsatis- 
fiable hitting clause-sets with many full clauses. This recursive definition uses 
an index, which is studied in Subsection The central helper function is the 
“slack”, studied in Subsection l4.3l We then prove a meta-Fibonacci recursion in 
Theorem [31] and obtain = ^2 in Theorem [33] 



4.1 A simple course-of- values recursion 
Definition 19. For h € No let 

1. S^O) := 0, /^(l) := 2/ and for k > 2: 

2. S' 2 (k) := 2-(k — i + l) for the minimal i G (1, . . . , k— 1} with k — i + 1 < S 2 (i). 

Note that the recursion step is well-defined (the i exists), since for i = k — 1 
holds k — i + 1 = 2, and S 2 (k — 1) = 2 for k = 2, while for k > 3 holds 
S' 2 (k - 1) = 2 ■ ((* - 1) - *' + 1) > 2 • ((jfc - 1) - ((k - 1) - 1) + 1) = 4. The 
condition “fc — i + 1 < S 2 (i)” is equivalent to k + 1 < i + S 2 (i). Some simple 
properties are that S 2 (k ) is divisible by 2, S 2 (k) > 2 for k > 1, and S '2 (2) = 4 
and S 2 (k) > 4 for k > 2. 
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4.2 Analysing the index 

Definition 20. For k > 0 let is (fc) := k + 1 — 1 £ N. 

Simple properties (for all k > 0): 

1. S' 2 (k) = 2-(k-i s (k) + l). 

2. ig(0)=ig(l)=i s (2) = l. 

3. Aig(k) = 0 <t4 AS 2 (k) = 2 and Z\ig(fc) = 1 <t=> AS 2 (k ) = 0. 

An alternative characterisation of ig (fc): 

Lemma 21. For k > 0: is(fc) is the minimal i £ No with i + S' 2 (i) >k + 1. 

Proof. The assertion follows by what has already been said above, plus the 
consideration of the corner cases: 0 + 5^(0) = 0 < k + 1 for all k > 0, while 
1 + ^(1) =3>fc + l for k<2. □ 

We obtain a method to prove lower bounds for S 2 (k ): 

Corollary 22. For k,i £ No with S 2 {i) > k — * + 1 holds S 2 (k) > 2(k — i + 1). 

is(fc) grows in steps of +1, while S' 2 (k) grows in steps of +2: 

Lemma 23. AS' 2 (k) € {0,2} and Aig(fc) G {0,1} for all k £ No- 

Pi'oof. Proof via (simultaneous) induction on k : The assertions hold for k < 1, 
and so consider k > 2. Now ig (k) is the minimal i £ {1, . . . , k — 1} with k + 1 < 
i + S 2 (i ), and due to AS 2 (i) > 0 for all * < k it follows Z\ig(/c) £ {0, 1}. □ 

We obtain a simple upper bound on ig: 

Corollary 24. For k > 1 holds is (k) < k and for k >2 holds is (fc) < k — 1 

4.3 The “slack” 

An important helper function is the “slack” sls(fc): 

Definition 25. For k £ No let sis (k) := (ig (k) + S 2 (is{k))) — (k + 1) £ No- 

So sls(0) = (1 + 2) — (0 + 1) = 2 and sls(l) = (1 + 2) — (1 + 1) = 1. Directly 
from the definition follows: 

Lemma 26. For k > 0 holds S 2 (is(k)) = \S 2 (k ) +sls(fc). 

We can characterise the cases Aig(fc) = 1 as the “slackless” k's: 

Lemma 27. For k > 0: 

1. A is(fc) = 1 sls(fc) = 0 AS 2 (k) = 0. 

2. Aig{k) = 0 <t^> sls(fc) > 1 AS 2 (k) = 2. 
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Proof. If sls(fc) > 1, then Z\is(fc) = 0 by Lemma [Til while for sis (A) = 0 we get 
A is(k) > 1. □ 

Thus the slack determines the growth of S 2 : 

Corollary 28. For k > 0 holds AS' 2 (k) = 2 • min(sls(fc), 1). 

And plateaus of the slack happen only for slack zero, and from such a plateau 
the slack jumps to 2, and then is stepwise again decremented to zero: 

Corollary 29. For k > 0 holds: 

1. If sls(fc) > 0, then sis (k + 1) = sls(fc) — 1. 

2. //sls(fc) = 0, then sis (fc + 1) € {0,2}. 

4.4 A meta-Fibonacci recursion 

We are ready to prove an interesting nested recursion for S 2 . First a combina- 
torial lemma, just exploiting the fact that the shape of the slack repeats the 
following pattern (Corollary [29]): a plateau of zeros, followed by a jump to 2 and 
a stepwise decrement to 0 again (where right at k = 0 we start with sls(0) = 2): 

Lemma 30. For k > 2 holds J2i = i sis (^ — i) = i * ' min(l, sls(fc — *)). 

Proof. There are 0 < p < 2 and 1 < q < 3 such that the left-hand side is 

p {p — 1)H bl + OH (-0 + 2 + (2 — 1) + •••-)- Q’, 

for p = 0 the initial part is empty, for q = 3 the final part is empty. Let r > 0 

be the number of zeros; so r = 0 iff p = 2 (and then also q = 3). We have 

p + r + (2 — q + 1) =2, i.e., p + r + 1 = q. Now the right-hand side is 

1 + 2 — |— • • • — |- — |— 0 — f — - - - — (- 0 — |— — |— (<7 — |— 1 ) — |— - - - — |— 2 , 

and we see that both sides are equal. □ 

Theorem 31. For k >2 holds 

2 

S! 2 (k) = J2s , 2 (is(k-i)) 

i= 1 

(note that by Lemma ^J\ holds is (k — i) <k). 

Proof. By Lemma [26] and Lemma l30l holds 

2 2 1 2 

E 5 2(is(fc - 0 ) = (E sl s( k - 0 ) + S' 2 (k) - E(^2 (k) - S' 2 (k - i)) = 

i= 1 i= 1 i= 1 

2 1 2 i-l 

S 2 (k) + min(l,sl s {k - i))) - + 

2—1 2=1 j = 0 



10 



where now by Corollary l28l holds Y%= 1 Y?j = o AS 2 (k + i — j) = ( AS' 2 (k — 1)) + 

(AS 2 (k - 2) + AS 2 (k - 1)) = Ya= i* ' AS 2 (k - 1) = 2]C- =1 * ' min (l, sl s (fc)), 
which completes the proof. □ 

Now we see that S' 2 is basically the same as a 2 (recall Subsection 13.31) : 

Corollary 32. Vfc G N 0 : S 2 (k) = 2 • a 2 (k). 

Proof. For the purpose of the proof let a 2 (k) := \ S 2 (k ) for k £ No- So we get 
a 2 {k) = k for k £ {0, 1}, while is (k) = k + 1 — a 2 (k), and thus for k > 2: 

2 2 

«2 (k) = ^ S 2 (k ) = ^2 (is (k = a 2 (is(k - i)) = 

Z i—1 i= 1 

2 

a 2 (fc — * + 1 — a 2 (fc — i)), 

i= 1 

and so the assertion follows by the equations of Definition [15] □ 

We obtain the main result of this section: 

Theorem 33. S 2 = S 2 ( recall Definition 05 

Proof. By Corollary 15121 and Theorem [T8l □ 



5 On the number of full clauses 

First we review full subsumption resolution, C U {v},C U {u} C, and its 
inversion, called “extension” in Section EU where some care is needed, since 
we need complete control. From a clause-set F with “many” full clauses we 
can produce further clause-sets with “many” full clauses by full subsumption 
extension done in parallel, and this process of “full expansion” is presented 
in Definition 1361 The recursive computation of S 2 via Definition [T9] captures 
maximisation for this process, and so we can show in Theorem 1381 that we can 
construct examples of unsatisfiable hitting clause-sets F of deficiency k and 
with S 2 (k) many full clauses. It follows that S 2 yields a lower bound on FCH 
(Conjecture [48] says this lower bound is actually an equality). 

5.1 Full subsumption resolution 

As studied in [23] Section 6] in some detail: 

Definition 34 ([23])- A full subsumption resolution for F £ CCS can be 

performed, if there is a clause C F with CU{u}, CU{v} £ F for some variable 
v, and replaces the two clauses C U {u}, C U {u} by the single clause C. For the 
strict form, there must exist a thnd clause D £ F \ {C U {v},C U {u}} with 
v £ var (D), while for the non-strict form there must NOT exist such a third 
clause. 
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If F' is obtained from F by one full subsumption resolution, then c(F') = c(F) — 
1; we have the strict form iff n(F') = n(F), or, equivalently, S(F') = 6(F) — 1, 
while we have the non-strict form iff n(F') = n(F) — 1, or, equivalently, d(F') = 
5(F). A very old transformation of a CNF (DNF) into an equivalent one uses 
the inverse of full subsumption resolutiorFl 

Definition 35 ([[23]). A full subsumption extension for F £ CCS and a 

clause C £ F can be performed, if there is a variable v £ VA \ var(G) with 
C U {u}, G U {u} ^ F, and replaces the single clause C by the two clauses C U 
{u}, G U {u}. For the strict form we have v £ var (F), while for the non-strict 
form we have v ^ var (F). 

If we consider F £ MU and C £ F, then we can always perform a non-strict full 
subsumption extension, while we can perform the strict form iff G is not full. If 
we denote the result by F' , then for F £ WRIT we have again F' £ WHIT , but 
for general F £ MU we might have F' MU; see [23j Lemma 6.5] for an exact 
characterisation. 

5.2 Full expansions 

We now perform full subsumption extensions in parallel to m full clauses of F, 
first using a non-strict extension, and then reusing the extension variable via 
strict extensions: 

Definition 36. For F £ CCS and m £ N, where fc(F) > m, a full m- 
expansion of F is some G £ CCS obtained by 

1. choosing some F'Cf’n A(v&r(F)) with c(F') = m, 

2. choosing some v £ VA \ va r (A) (the extension variable), 

3. and replacing the clauses C £ F' in F by their full subsumption extension 
with v (recall Definition \35\) . 

The choice of v in Definition [301 is irrelevant, while the choice of F' might 
have an influence on further properties of G, but is irrelevant for our uses. The 
following basic properties all follow directly from the definition: 

Lemma 37. Consider the situation of Definition \36l 

1. There is always a full m-expansion G (unique for any fixed F' , v). 

2. If F £ UHIT, then G £ UHIT. 

3. n(G) = n(F ) + 1, c(G) = c(F) + m. 

4- 5(G) = 5(F) + to — 1. 

5. fc(G) = 2 • to.. 

We turn to the construction of unsatisfiable hitting clause-sets with many 
full clauses (for a given deficiency): 

4 ) Boole introduced in [2], Chapter 5, Proposition II, the general “expansion” f(v, x) = 
(/( 0, a;)AU)V(/(l, x)Av) for boolean functions /, where for our application f(v, x) ss 
C. This was taken up by [26], and is often referred to as “Shannon expansion”. 
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Theorem 38. For k £ N we recursively construct F k £ WHTTs= k as follows: 

1. F\ := {{1}, {— 1}}. 

2. For k >2 let Fk be a full 02(h) -expansion of F- ls ( k y 

Then we have ic(F k ) = £ 2 (A;) ■ Thus Vfc G N : £ 2 (fc) < FCH(fc). 

Proof. If the construction is well-defined, then we get ic(F k ) = 2 • 02 (h) = S 2 (k) 
and S(F k ) = 6(F ls ( k )) + 02(h) — 1 = is(fc) + a 2 (fc) — 1 = k for k > 2 by Lemma [371 
(using Theorem [531 freely) , while these two properties hold trivially for k = 1 . 

It remains to show that 1 < is(fc) < k — 1 and a 2 (k) < ic(F- ls ( k )) for k >2. 
The first statement follows by Corollary [24] while the second statement follows 
by Lemma I2T1 □ 

6 Applications 

We start by sharpening the upper bound from Lemma IT2l 
Theorem 39. For k £ N holds S 2 (k) < nM(fc) < k + 1 + [log 2 (fc)J ■ 

Proof. By Theorem 1881 and Theorem [3] □ 

We can also provide an independent proof of the lower bound of Lemma 1121 
Lemma 40. For k £ N holds 82(h) > k + 1. 

Proof. We prove the assertion by induction. For k = 1 we have £ 2 ( 1 ) = 2, so 
consider k > 2. We use Corollary[22l and so we need i£tf with fc+1 < 2(k—i+l), 
i.e., i < So we choose i := G N. We have i < k, and so we can apply 

the induction hypothesis to i: i + £ 2 (*) = L^ 2 ^J + > L^ 2 ^J + L^ 2 ^J + 

1 = 2|_^2-J + 1 > 2(^±I — 1) + 1 = k, and thus i + £ 2 ( 2 ) > k +1. □ 

When upper and lower bound coincide, then we know all four fundamental 
quantities; first we name the sets of deficiencies (recall Theorems [SJ [l]) : 

Definition 41. SAfM := {k £ N : S 2 (k) = nM(fc)}, SMM X := {k £ N : 
S 2 (k) = nM 1 (fe)}. 

By £2 < VDM < nMi < nM we get SMM C and: 

Theorem 42. For k £ SNM\ holds £ 2 (fc) = FCH(fc) = FCM(fc) = VDH(fc) = 
VDM(fc) = nM x (k). 

We prove now that the special deficiencies 2 n — n, 2 n — n — 1 (n > 1; note 
S(A n ) = 2 n — n) considered in [23] Lemmas 12.10, 12.11], where we have shown 
that for them the four fundamental quantities coincide, are indeed in SAfA4, and 
that furthermore the special deficiencies 2 n — n + 1 (n > 3), where nMj differs 
from nM, are in SMM\. 

Lemma 43. Consider nSN. 
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1. S 2 (2 n -n) = 2 n , and for k G N 0 holds S 2 (k) = 2 n o 2 n - n < k < 2 n - 1. 

2. 2 n - n G SJ\fM, while 2" - n + 1, . . . , 2" - 1 £ SMM. 

3. Assume n >2 now. Then 2 n — n — 1 G SJ\fA4 with S 2 (2 n — n — 1) = 2 n — 2. 
f. For n > 3 holds 2 n — n + 1 € SMM\ . 

Proof. By [23] Corollary 7.24] we have nM(2"-n) = 2 n , while nM(2"-n - 1) = 
2 n — 2 (remember that the jumps for nM happens at the deficiencies 2 n — n). 
Thus S 2 {2 n — n) < 2 n and S 2 (2 n — n — 1) < 2" — 2. Since for the value 2" 
the sequence S 2 has a plateau of length n iLemrna flTTl) . while nM is strictly 
increasing, for Parts [L] [ 2 ] [ 3 ] it remains to show S 2 {2 n — n) > 2 n . We show this 
by induction: For n = 1 we have S 2 (1) = 2 = 2 1 , while for n > 2 by induction 
hypothesis we have (2 n — n) — (2 n_1 — (n — 1)) + 1 = 2 n ~ 1 < S 2 (2 n ~ 1 — (n — 1)), 
thus by Corollary [22] S 2 (2 n — n) > 2 • 2" _1 = 2". Finally, for Part [4] we note 
S 2 (2 n — n + 1) = S 2 {n ) = 2" by PartH] while nMj(fc) differs from nM(fc) exactly 
at the positions k = 2" — n+ 1 for n > 3, where then nMi(fc) = nM(fc) — 1 = 2 n 
1 |231 Theorem 14.7]). □ 

So the lower bound of Lemma |40l is sharp for infinitely many deficiencies: 

Corollary 44. We have S 2 (k ) = k + 1 for all k = 2 n — 1, n G N. 

7 Initial values of the four fundamental quantities 

The task of this penultimate section is to prove the values in Table[l](in Theorem 
SZl of course, only the four fundamental quantities are open). 



k 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


nM(fc) 


2 


4 


5 


6 


8 


9 


10 


11 


12 


13 


14 


16 


17 


nMi(Jfe) 


2 


4 


5 


6 


8 


8 


10 


11 


12 


13 


14 


16 


16 


VDM (k) 


2 


4 


5 


6 


8 


8 


10 


11 


12 


13 


14 


16 


16 


VDH (k) 


2 


4 


5 


6 


8 


8 


10 


11 


12 


13 


14 


16 


16 


FCM (k) 


2 


4 


4 


6 


8 


8 


9 


10 


12 


12 


14 


16 


16 


FCH (k) 


2 


4 


4 


6 


8 


8 


8 


10 


12 


12 


14 


16 


16 


S 2 (k ) 


2 


4 


4 


6 


8 


8 


8 


10 


12 


12 


14 


16 


16 



Table 1. Values for the fundamental quantities for 1 < k < 13; in bold the columns 
not in SAfMi, while the vertical bars are left of the special deficiencies 2 n — n, n > 2. 



Strengthening [23] Corollary 12.13], first we show strong properties for min- 
imally unsatisfiable clause-sets F such that the number of full clauses equals 
the min-var-degree, i.e., there is a variable which occurs only in the full clauses. 
We use var ^{F) := (c G var(F’) : vdF(u) = /xvd (F)} for F G CCS with 
n(F) > 0 (the set of variables with minimal degree). Furthermore we use DP„(F) 
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for F £ CCS and v £ var (F) for the result of replacing the clauses containing 
variable v by their resolvents on v, indeed the special use in Lemma 05] will be 
just the inverse of the expansion process from Definition 1361 

Lemma 45. Consider F £ MU with ic(F) = /j.vd(F ) (and thus n(F ) >0/ 

1. var^vd {F) is the set of all v £ var (F) which occur only in full clauses of F. 

2. fc(F ) is even. 

3. For v £ var MV( j (F) and F' := DP„(F) we have F' £ MU g _ s ^ fc ( f) f r 

4. ic(F ) < 2 • FCM {6(F) - + 1). 

Proof. Consider v £ var(f) with vd^(w) = /ivd(-F). The occurrences of v are 
now exactly in the full clauses of F (Part|TJ) . Every full clauses must be resolvable 
on v, and thus the full clauses of F can be partitioned into pairs {w} U C , {u} U C 
for C4C1 many clauses C. This shows Part[2] Parts[3j[4]now follow by considering 
F' := DP„(F): F' is obtained by replacing the full clauses of F by the clauses C 
(i.e. , performs a full subsumption resolution, which arc all strict except of the last 
one, which is non-strict). The new clauses C are full in F' (though there might 
be other full clauses in F'). Obviously F' £ MU and S(F') = 5(F) — + 1. 

□ 



For deficiency k = 7 we have the first case of FCH(fc) < FCM(fc): 

Lemma 46. FCM(7) = 9 = nM(7) - 1, while FCH(7) = 8 = S 2 ( 7). 

Proof. By S 2 (7) = 8 we have FCH(7) > 8. By Lemma 05l Part 0] and by 
FCM(3) = 4 the assumption of FCM(7) = 10 = nM(7) yields the contradic- 
tion 10 < 2FCM(7 — 5 + 1) = 2- 4 = 8, and thus FCM(7) < 9. By LemmaUwe 
obtain FCH(7) = 8. A clause-set F £ MUs=r with fc(-F) = 9 (and n(F) = 4) is 
given by the following variable-clause-matrix: 

+ 0\ 

+ + + - + -0 

+-+-+-0+++- 

V+ + + + + +0 

Let the variables be 1, ... ,4, as indices of the rows. Now setting variable 4 to 
false yields A3, where one non-strict subsumption resolution has been per- 
formed, while setting variable 4 to true followed by unit-clause propagation of 
{—3} yields A 2 . So both instantiations yield minimally unsatisfiable clause-sets, 
whence by [53] Lemma 3.15, Part 2] F £ □ 

We are ready to prove the final main result of this paper: 

Theorem 47. Table[I\ is correct. 



5) [23] Lemma 3.15] contains a technical correction over i2l| Lemma 1]. 
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Proof. The values for 1 < k < 6 have been determined in |23j Section 14]. We 
observe that 1, 2,4, 5, 6, 9, 11, 12, 13 e SAfMi, and thus by Theorem l42l nothing 
is to be done for these values, and only the deficiencies 7, 8, 10 remain. 

By Lemmal'151 Part[U we get that FCH( 8 ) = FCM( 8 ) = 10 (since nM( 8 ) = 11 
is odd), and also FCH(10) = FCM(10) = 12. By Lemma l46l it remains to provide 
unsatisfiable hitting clause-sets witnessing VDH(7) = 10, VDH( 8 ) = 11 and 
VDH(10) = 13. For deficiency 7 consider 

/OH 1 h H h\ 

p 0 1 — I 1 1 — h 

7 - + + + + + +0 • 

V + + +0+ + +/ 

F~j has 4 variables and 11 clauses, thus S(F 7 ) = 11 — 4 = 7; the hitting property is 
checked by visual inspection, and F 7 is unsatisfiable due to 8 - 2 _4 + 2 - 2 _ 3 -|- 2 _2 = 
- + i + i = 1 , while finally every row contains exactly one 0 , and thus F 7 is 
variable-regular of degree 10 = nM(7). 

Finally consider A4 with /(A 4 ) = 16 — 4 = 12 and p,vd(At) = 16: perform 
four strict full subsumption resolutions on variables 1 , 2 , 3, 4, and obtain elements 
of WHTT of deficiency 11, 10, 9 , 8 with min- var- degree 14, 13, 12, 11. □ 

8 Conclusion and Outlook 

In this paper we have improved the understanding of the four fundamental quan- 
tities, by supplying the lower bound S2 < FCH. The recursion defining S' 2 sheds 
also light on S2 = S ' 2 , and we gained a deeper understanding of S2 = 2 a 2 - 
Moreover we believe 

Conjecture j8. Vfc € N : 82(h) = FCH(fc). 

This would indeed give an unexpected precise connection of combinatorial SAT 
theory and elementary number theory. On the upper bound side, by Conjectures 
12.1, 12.6 in ]23 (see Figure 1 there for a summary of the relations between the 
four fundamental quantities) we get: 

Conjecture j9. Vfc € N : nM(/c) — 1 < FCM(fc) < VDM(fc) = VDH(fc). 

Recall that VDM(fc) < nM(fc); so we believe that three of the four fundamental 
quantities are very close to nM(fc). This is in contrast to nM(fc) — 82(h) being 
unbounded, and indeed 82(h) = k + 1 for infinitely many k (Corollary 14411 . 
while by Lemma 03] we also know 82(h) = nM(fc) for infinitely many fc, and 
thus S2 oscillates between the linear function k + 1 and the quasi-linear function 
nM(fc). To eventually determine the four fundamental quantities (which, if our 
conjectures are true, boil down to VDM and FCM, while VDH = VDM and 
FCH = S' 2 ), detailed investigations like those in Section [T] need to be continued. 

As FCH(fc) and 82(h) are closely related via (boolean) hitting clause-sets, via 
generalised (non-boolean) (hitting-) clause-sets (see |19I20| for the basic theory) 
we can establish a close connection to the S p (k) for all prime numbers p in forth- 
coming work (S p (k) is the smallest n £ No such that p k divides n!, introduced 
hr ]28] Unsolved Problem 49]). 
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